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Abstract 

One of the main trend in to date research and development is the miniaturization of electronic 
devices. In this perspective, integrated nanodevices based on proteins or biomolecules are attract- 
ing a major interest. In fact, it has been shown that proteins like bacteriorhodopsin and azurin, 
manifest electrical properties which are promising for the development of active components in 
the field of molecular electronics. Here we focus on two relevant kinds of proteins: The bovine 
rhodopsin, prototype of GPCR protein, and the enzyme acetylcholinesterase (AChE), whose in- 
hibition is one of the most qualified treatments of Alzheimer disease. Both these proteins exert 
their functioning starting with a conformational change of their native structure. Our guess is that 
such a change should be accompanied with a detectable variation of their electrical properties. To 
investigate this conjecture, we present an impedance network model of proteins, able to estimate 
the different electrical response associated with the different configurations. The model resolution 
of the electrical response is found able to monitor the structure and the conformational change of 
the given protein. In this respect, rhodopsin exhibits a better differential response than AChE. 
This result gives room to different interpretations of the degree of conformational change and in 
particular supports a recent hypothesis on the existence of a mixed state already in the native 
configuration of the protein. 
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I. INTRODUCTION 



Proteins are at the basis of living systems. In particular, they control the sensing action 
and the correct functioning of the system by operating at a cellular level. In a cell, the 
starting process of this functional control is usually associated with a conformational change 
of the protein in response to the capture of a specific ligand. The conformational change 
activates a sequence of biological mechanisms, which end in the production of a final stimulus 
used by the system to organize its life. The possibility to detect such a conformational 
change through some modification of an electrical property of the corresponding protein is 
of extreme interest from both a fundamental and an applied point of view. 

On the former side, the nature of charge transport in protein jl[ and, in general, in 
biological matter (2|, is a long time explored subject till now not completely understood 0]. 
Most of the models are based on a quantum mechanical tunneling |3J]. On this respect, some 
lean towards a charge transport along the protein backbone 0], others point to a "rest and 
fire" thermal mechanism in which charge is injected only for certain values of the dihedral 
angles (!]. 

On the latter side, applications, the use of proteins, and in general of biomolecules, for the 
construction of nanodevices is one of the leading frontiers in biology, physics and technology. 
In fact, there are various concomitant peculiarities which point to biomolecules as promising 
material for new devices. Among them, we remember their extremely small sizes, their 
specificity, i.e. the ability to react only to a specific stimulus (or a very narrow window of 
similar stimuli), their conductive properties p, LZI], and not negligible, their low cost [8(. 

In this context, the most intriguing problem is to understand how conformational changes 
influence the electrical properties of a protein. At present, most of the information about the 
different structures of proteins comes from X-ray or NMR investigations The polypeptide 
is crystallized (X-ray analysis) and then a map of all its atoms is acquired. Starting from 
these representations it is possible to model the protein by using, either an all-atom scheme 
like in Molecular D yna mics techniques [loj . or an all-amino acid scheme like in Elastic 
Network models (III . 12l . 13l 14j . The former method is surely more accurate but needs for a 
big computational environment, the latter method requires more affordable computational 
resources and anyway is able to catch the relevant features of the polypeptide. Many authors 
have described the topological and statistical properties of this kind of network, also inducing 
physical information in fine accordance with the experimental tests (ill. [l2l. 13. 13 . 



The aim of this paper is to develop an irregular network model, which translates a given 
protein topology into the protein electrical properties. Accordingly, starting from the three- 
dimensional representation of the protein, the model will carry out a comparative analysis 
of its geometrical structures and predict some features of its small signal impedance. Cal- 
culations are specialized to bovine rhodopsin, the prototype of G-protein coupled receptors 
(GPCR), and to the enzyme AChE, whose inhibition is one of the most qualified treatments 
of Alzheimer disease. By means of Nyquist plots, the electrical response of the corresponding 
networks are then investigated for different values of frequency of the applied external field. 

The content of the paper is organized as follows. Section II briefly reviews the theoretical 
approach. Section III reports the results obtained for the two proteins considered here 
by analyzing the topological network and the impedance spectra for the known structures 
available in the literature. Major conclusions are drawn in Sec. IV. Finally, the Appendix 
details the values of the protein equivalent circuit obtained from the simulations. 
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II. MODEL 



Since the seminal paper of Tirion 12] , many researchers have investigated the possibility 



to model proteins by means of appropriate networks |13|, [14[]. This coarse grained approach 
is computationally more affordable than a molecular dynamics procedure 1(| and, never- 
theless, able to simulate the global behavior of a given protein fl2l . [lH 14 |. Within this 
approach, a protein is mapped into a network whose nodes coincide with the coordinates of 
the so-called C a carbon atom pertaining to each amino acid as known from the public data 
base (PDB) [9]. As free parameter, the model contains the interaction (cut-off) radius, R , 
which fixes the maximal distance between two connected nodes: Only nodes with distance 
less or equal to R are connected with a link. This simple model provides a map of the 
protein spatial organization, i.e. a topological network or a graph. It becomes pregnant of 
meaning only when the links take a physical role. This was made, for example, by assigning 
a spring constant to each link and thus studying the network normal modes [HEGl- 

In some recent works 15, 16, 13], by assigning to each link an impedance, we proposed 
a new strategy for the investigation of the protein electrical response. Furthermore, on the 
same ground, also the role of thermal fluctuations was tested [H, 16, 17]. Here we use 



the same strategy by assuming that the amino acids interact electrically among them: The 
electrical charge transfers between neighboring residues (l8| . which can also change their 
electronic polarization [19j. More precisely, in this model each link between the nodes i 
and j corresponds to an elementary RC parallel impedance Zij, with R an Ohmic resistor 
and and C a planar homogeneous capacitor. By assuming that the cross-sectional area of 
resistor and capacitor is equal to the cross-section area of overlapping spheres, the elementary 



impedance becomes [16 
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where Aij = n(R 2 — ^-/4), is the cross-sectional area between the spheres centered on the 
i, j nodes, respectively, is the distance between these centers, p is the resistivity, taken to 
be the same for every amino acid with the indicative value of an insulator p = 10 10 f2 m; % is 
the imaginary unit, i = y/—l, eo is the vacuum permittivity and u is the circular frequency 
of the applied voltage. The relative dielectric constant of the couple of i, j amino acids, qj, 
is expressed in terms of the intrinsic polarizability of the i, j amino acids, «j, ctj 
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where g=4.65 is a normalization constant introduced to obtain values of distributed 
between 1 and 80 (relative permittivity of vacuum and water at 20 °C, respectively). 

By taking the first and last amino acid as injector and collector contacts at the given 
electrical potential difference, the calculation of the total impedance of the network is ob- 
tained by solving the electrical circuit, i.e. we have found the potential on each node and the 
current flowing through each link for a given applied external voltage [2(|. To this purpose 
we made use of the standard procedure of solution of linear networks based on Kirchhoff 's 
laws. More precisely, in the present case of an irregular network with complex topology, it 
was particularly convenient to write and solve the Kirchhoff 's node equations. 

This one-node impedance model, henceforth also called AA model, is here further imple- 
mented by considering also the case of having two nodes for each amino acid, henceforth also 
called AB model. In fact, by identifying the amino acid with its C a , in the one-node model 
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the resulting structure becomes analogous to the polypeptidic backbone. However, in this 
way only the backbone behavior can be reproduced while, in a conformational change, the 
backbone displacement is not the only relevant transformation. For example, one could be 
interested to study the rotations of each amino acid around the backbone [5j . To account for 
these possibilities, we have to depart from the one-node model, and look for a more realistic 
picture. Since the distinctive mark of each amino acid is in its residue, which does not lye 
on the polypeptidic backbone, we found natural to fix on each amino acid a second node. 
Accordingly, as second node we choose the so-called Cp atom, i.e. the second carbon atom 
that attaches to the functional group. The atom is present in all the amino acids with the 
exclusion of glycine. The impedance attributed to the new links arising from the presence 
of this second node is taken of the same form as in Eq. (1). In this way, the total number 
of nodes (u) mapping the protein is practically doubled. To connect the nodes we have 
adopted two choices, namely: The isotropic network and the directed network, this latter 
choice bein g ab le to better exploit directional characteristics in analogy with the directed 
percolation [211 ] . In the isotropic network, the a and j3 nodes are considered to be equivalent. 
Accordingly, each node is connected with all the others inside the interaction radius. Thus, 
for sufficiently large R , each node has v-\ connections. In the directed network, the a and f3 
nodes are not equivalent. Accordingly, for a given interaction radius, each a node, identified 
by the serial number of the protein primary structure, is connected to the f3 node of its 
amino acid and to the a and (3 nodes of amino acids with higher serial number. By contrast, 
each j3 node is linked to (3 nodes pertaining to amino acids with higher serial number. 
Accordingly, for the isotropic network, the maximum value of the total number of links 

is: 

KZopic = (N a + Np)(N a + Np- l)/2, (3) 
while, for the directed network it is: 

NZZtea = (N a )(N a - l)/2 + {N fi )(N p - l)/2 + £E N a Np. (4) 

a 0>a 

Here N a is the number of C a , coincident with the total number of amino acids, N, and Np 
is the number of Cp. We observe that the considered proteins have, respectively, 348 C a 
atoms and 325 Cp atoms (rhodopsin) and 527 C a atoms and 483 Cp atoms (AChE), so we 
can assume Np « N and then deduce the expressions: 

KZo* « 2iV(2iV-l)/2, 

NZ X ect ed « N(3N - l)/2. (5) 

By comparing Eqs. ©, (jlj), © we notice a global different functioning of the isotropic and 
directed networks. In the former network, each node represents an independent unity that 
interacts in the same way with all the other unities by elongating (N a + Np — 1) ~ 2N — 1 
links. In the latter network, each amino acid represents a working unit that interacts with 
other units through only three kinds of links, respectively: C a - G a , C a - Cp, and Cp - 
Cp. The number of links drawn out by each amino acid is ~ 3N — 1. We notice, that 
the isotropic network exhibits a number of links that is in general larger than the directed 
network up to a maximum value of about 25%. 

In what concerns with the contacts to the external bias, in the AA model these are 
positioned on the first and last amino acid. In the AB model we have explored three 
possibilities. Accordingly, as injector and collector nodes we have taken: (i) the first C a and 
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the last Cg, AB a)i g model; (ii) the couple a — (5 of the first amino acid, and the couple a — f3 
of the last amino-acid, AB a/ g ja/ g model; (iii) the first C a atom and the last C a atom, AB a a 
model. 



III. RESULTS 

In the following, we investigate the structural and electrical properties of a protein 
through both the graph model and the impedance network model. Then, we compare 
the electrical responses of a protein before and after undergoing a conformational change by 
using the AA and the AB models. The proteins under test are the bovine rhodopsin in dark 
(native) and light (activated), and the enzyme acetylcholinesterase (AChE) in the native 
form and complexed form with Huperzine A. The former is the most explored structure in 
the class of the GPCR and is widely used as a prototype for deducing the structure and 
function of all the other proteins pertaining to the same class 22, 23[. The latter plays a 



fundamental role in the process of functioning of the muscle cells. It destroys the neuro- 
transmitter acetylcholine after it has passed to these cells the information coming from the 
brain, so that new signals can be transmitted. Starting from the atomic coordinates of 
the given protein, as reported in the public data base (PDB) [§], we first investigate the 
topological properties of the networks as function of the interaction radius. Then, the anal- 
ogous investigation is carried out for the total impedance spectrum of the corresponding 
impedance network. 



A. The topological network 

Bovine rhodopsin and torpedo acetylcholinesterase are quite different proteins, both in 
structure and in functioning. The former is a seven-a-helices transmembrane protein, acting 
as a light receptor. It is able to capture photons, then producing a cascade process that 
starts with the activation of a G-protein and ends with the transmission of information to 
brain. The latter is a globular protein, made of fourteen /3-sheets and sixteen a-helices. It is 
an enzyme that breaks the neurotransmitter acetylcholine into acetic acid and choline, thus 
stopping the transmission of signal from nerve cells to muscle cells. 

At present, in the PDB jof there are 26 entries related to bovine rhodopsin and 56 entries 
related to torpedo acetylcholinesterase. For rhodopsin, only two of the 26 entries describe 
the protein in light, namely: 1LN6, an NMR product, and the most recent 2137, obtained 
with X-ray analysis. None of the above two entries reports the entire sequence of amino 
acids. For the rhodopsin in dark (the native state) there are three complete sequences, 
namely: 1U19, 2G87, 2HPY, all obtained by X-ray diffraction, with different resolution. The 
AChE is reported in many, incomplete, different representations, either native or complexed 
(the activated state) with different molecules (mainly Tacrine, Ri vast igmine, Galantamine, 
Huperzine A/B). 

The first task we address is to asses the level of resolution the present topological network 
model can reach when discriminates among similar (native or activated) representations and 
between the native and activated state. In other words, we would check whether the network 
provides a sensitive map of the protein structure. To this purpose, the analysis is performed 
by using the AA model. 

Figure 1 reports the difference in the number of links between couples of structures of 
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rhodopsin, native (left) and activated (right), considered here as a function of the inter- 
action radius. The native (Rho) and activated (Metall) configurations refer to en gine ered 



structures obtained from the incomplete 1LN6, 1JFP and 1F88 (chain A) [15|, [16|, [17| and 
contain the same number of amino acids (348). The comparison among the three native 
representations of rhodopsin, 1U19, 2G87, 2HPY and the engineered one, Rho, shows that: 
(i) 1U19 and 2G87 exhibit practically the same number of links and, (ii) there are consistent 
differences between the native and engineered configurations. Furthermore, when moving 
from the native to the activated state, we found an increase of the difference in the number 
of links. 

Figure 2 reports the difference in the number of links between couples of native (left) and 
complexed (right) representations of AChE as a function of the interaction radius. Here, the 
model correctly predicts that two distinct chains of the same representation of native AChE 
(lEA5 a /b) keep practically the same number of links independently of the interaction radius. 
On the other hand, representations of native AChE obtained under different experimental 
conditions show significative differences in the number of links, with a maximum value 
around R = 25 A . Some structures of complexed AChE are compared with the native form 
2ACE: The difference in the link number can be positive or negative with respect to the 
structure considered, and the maxima differences are comparable with those between couples 
of native structures. By comparing rhodopsin with AChE, we notice that the values of the 
difference in the number of links and their dependence upon R are very similar, either in 
the native or in the activated state. However, we remark that AChE with its 527 amino 
acids (2ACE) is a protein substantially greater than rhodopsin with its 328 amino acids. As 
a consequence, the relative value of the difference in the number of links is found to be more 
consistent for rhodopsin than for AChE. 

From figures 1 and 2, it is also evident that the representations of a given protein in the 
same state (native or activated) look sometimes comparable with those of representations of 
the same protein in different states. In other words, different experimental conditions may 
produce very different representations of the same state of the protein. Therefore, if the 
network model is used to discriminate between the native and the activated (or complexed) 
state of the protein, then, it is mandatory that the reference representations should be 
produced under the same experimental conditions. For this reason, in the following we 
use the couple Rho - Metall, for native and activated rhodopsin, and 2ACE - 1VOT-2 (X- 
ray products, same experiment) for native and complexed (with Huperzine A) AChE. The 
1VOT-2 structure is the amino-acid sequence 1VOT deprived of two amino acids, ALA536, 
CYS537, which are not present in 2ACE. 

To emphasize the network model ability to catch the protein topology, we have calculated 
the adjacency matrix 24| by representing the links in a x-y plane where the serial number 
of protein amino acids is reported on the x and y axes. Here each link corresponds to a 
point. Accordingly, Fig. 3 reports the map of rhodopsin in dark, Rho: The helix-to-helix 
links are very evident, either for R = 6 A (dark points) or, even better, for R = 12 A (grey 
boxes). We find that the links reproduce the closeness of H2 with H3 and HI, and of H2 and 
H4; furthermore they suggest the presence of H-bonds among H1,H2,H7 and H2,H3,H4 
and also between H3 and H6 and between H6 and H7 j23|. Notice that the helix couples 
(H1-H3), (H1-H4), (H1-H5), (H1-H6), (H2-H5), (H2-H6),(H2-H6), (H4-H7),(H5-H7) are not 
connected for these values of the interaction radius. 

Figure 4 reports the map of AChE. The adjacency matrix has been calculated for R 
= 6 A (dark points), and for R = 12 A (grey boxes). Here we notice an inhomogeneous 
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distribution of links, which is more tangled than in rhodopsin, mainly due to the higher 
complexity of the AChE protein. It is impossible, within a single figure, to report all the 
connections among sheets and helices. Thus, we emphasize only some of the them, the most 
evident, mainly reproducing the closeness between /3-sheets. 

We conclude that the graph analysis of proteins provides a valuable sketch of the force 
connected regions in the protein. In fact, by considering short range forces and using R 
as the parameter describing their cut-off distance, Figs. 3 and 4 identify the interacting 
regions of the protein. The increase of R is equivalent to consider forces with longer range. 
Accordingly, the drawings in Figs. 3 and 4 emphasize the dependence from the interacting 
radius of the connective map of the network. 



B. The impedance network 

By attributing to each link of the network an elementary impedance as given by Eq. (JT]), 
we have calculated the frequency response of the impedance network, Z(f), in the frequency 
range 0-^1100 Hz, and represented it by means of the Nyquist plot. This kind of plot is 
very used in signal processing and in characterizing the electrical properties of biological 
materials appropriately deposited on functionalized gold electrodes [25j. It combines the 
imaginary and real part of a transfer function, in our case the impedance associated with 
the protein, using the frequency as an implicit variable. By means of the Nyquist plot, we 
compare the electrical response of the protein in its native and activated states for different 
values of the interaction radius. 

The AA model contains one free parameter in the value of the cut-off radius R , which 
fixes the number of links and so the network topology. In the limit of R values too small (to 
say R < 6 A), only the nearest neighbours are connected, and so it is not possible to reveal 
the existence of more complex structure like a-helices or /5-sheets. On the other hand, in the 
limit of R values too large (to say R > 80 A ), each node is connected with all the others, and 
so the protein appears as a uniform structure. A value of R that is relevant for our purposes 
should be that which enables the main structures of the protein to emerge clearly. In fact, 
we are interested in detecting if and how they displace in the protein conformational change. 
Accordingly, we look for a value of R best revealing the main structures of protein but also 
emphasizing the differences between the activated and the native state of the protein. For a 
GPCR, a relevant value of R is a compromise between the characteristic dimension of the 
a-helices and the typical distance among a-helices, say D. When rhodopsin goes from the 
quiescent to the active state, its a-helices change their relative distance and D goes in D'. 
In the corresponding network, when the value of R is between D and D', a huge number 
of links change their value, so well revealing the conformational change. We call D the 
effective distance" , and use it as a reference length value. 
In order to explore the different topologies associated with the changing of R , in the frame 
of the AA model, we have evaluated the network degree distribution, i.e. the distribution of 



i; 



the connected nodes [24j . The results of calculations are reported in Fig. 5 for rhodopsin and 
in Fig. 6 for AChE. Here we observe that for R < 9 A , both for rhodopsin and AChE, the 
degree distribution remains substantially peaked around the same degree value, i.e., there 
is a single characteristic dimension of the network clustering. It corresponds to the nearest 
neighboring domain (k=6 for AChE, k=7 for Rho). The cluster dimension grows for enlarged 
R , until the value R =12 A for rhodopsin and R =9 A for AChE. We notice, indeed, that 
for R = 12 A the degree distribution of rhodopsin exhibits two prominent maxima at k = 25 
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and 37, respectively (see Fig. 5), and so we have two different clusterizations. On the other 
hand, for the same value of R , AChE exhibits a degree distribution randomly spiked in the 
range 20 < k < 45. For values of R in the range 12 25 A , we have found a spreading of 
the distribution, which exhibits a series of spikes representing the fingerprint of the tertiary 
structure of the given protein. For values of R > 25 A , the degree distribution is found to 
shrink (see insert in Fig. 6) and, at R = 80 A , all the nodes are found to be practically 
connected each other. Here the degree distribution takes a delta-like shape centered at 
k=(z/-l). We conclude that, R = 9 A , for AChE, and R = 12 A, for rhodopsin, should be 
taken as optimal values to obtain the best resolution of the intimate protein structures. 

From the above considerations, in the following we discuss four possible cases, in which the 
interaction radius R , and the effective distance D combine to produce different resolutions 
for the AA and AB directed model, respectively. (Notice that the effective distance for the 
AB model has been assumed larger than that for the AA model because of the finite size of 
the amino acid.) 

I. D AA ~ R, D AB > R, D' AA > R, D' AB > R 

II. D AA < R, D AB ~ R, D' AA > R, D' AB > R , , 
///. D AA < R, D AB <R, D' AA <R, D' AB >R W 
IV. D AA < R, D AB < R, D'aa < R, d 'ab < R 

Keeping in mind that the condition D < R produces links and that, on the contrary, 
D > R does not, the preceding cases are analyzed as follows: 

Case I. Here the AA model discriminates different protein states better than the AB 
model. 

Case II. Here both the AA and the AB models are able to resolve well the two config- 
urations. In particular, the AA model is more sensitive to the change of the interaction 
radius. 

Case III. Here the AA model discriminates different protein states worse than the AB 
model. 

Case IV. Here it is rather difficult to discriminate the configurations both for the AA 
and the AB models since the number of links remains practically the same in both the 
configurations. 

The general trends discussed above can be quantitatively assessed for the proteins under 
test by selecting a significative set of R values for rhodopsin and AChE, respectively. Ac- 
cordingly, for rhodopsin we take the R values 6, 12, 25, 50 A , and for AChE the R values 
6, 9, 12, 25 A . The results of calculations are reported with an accuracy of three digits (see 
Appendix), which is considered appropriate for an experimental validation of the model. 

Figures from 7 to 9 report the Nyquist plots of the global network impedance, normal- 
ized to the value at zero frequency, Z(0), for the case of the engineered representations of 
rhodopsin in the native state, Rho, and in the activated state, Meta II. In all the figures 
the AA model is compared with the AB models by adopting the same convention for the 
symbols. In each figure the plots corresponding to increasing values of R are indicated as 
(a), (b), (c), (d) in clockwise orientation. 

As a general trend, the shape of the Nyquist spectra remains quite close to that of 
a semicircle, typical of a single RC parallel impedance, except for small but significant 
deviations from the semicircle when R = 6 A . 

Figure 7 shows the different impedance responses obtained with the AA model (tiny 
continuous line for Rho and dotted line for Metall) and with the AB a a directed model 
(bold continuous line for Rho, and dashed line for Meta II). 
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Figure 8 shows the Nyquist plots for the AA model and the AB a/3tCe/3 directed model, 
respectively. 

Figure 9 shows the Nyquist plot for the AA model and the AB isotropic model. 

For R = 6 A, in all the cases the AA model exhibits a resolution between different 
configurations better than the AB models. Thereby, the value R = 6 A pertains to case I: 
This value is a good choice but not the best. 

The value of R — 12 A provides the largest difference between the activated configuration 
and the native one. Furthermore, we find that the AB a Q directed model (see Fig. 7), and 
the AB Q/ g Q/3 directed model (see Fig. 8) increase the differences, respectively of 7%, and 5% 
with respect to the AA model. The value R = 12 A pertains to case II: The best value for 
the AA model, a very good value for the AB directed model. 

For R = 25 A , the difference between the configurations starts to decrease for both the 
models, even if the directed AB model still exhibits a resolution increment with respect to 
that of the AA model of 7%, and 3% for its versions, AB a>a and AB Q/3iQ/3 , respectively. The 
value R = 25 A pertains to case III: For the AB models, this value of R is still a relevant 
one. 

Finally, for R = 50 A , the directed AB model exhibits a resolution increment with respect 
to that of the AA model, of 3%, and 1% for its versions, AB a a and AB a ^ a ^, respectively. The 
ability to resolve is decreasing but it remains significant for both the models. Accordingly 
the value of R =50 A is on the boundary between case III and case IV. 

Figure 9 reports the comparison between the AA and the isotropic AB model. We can 
observe, that for R > 6 A the compared models give practically the same results, in other 
words, the AB isotropic model does not improve the AA model. This outcome says that 
the AB isotropic model does not give to the C/3S an active role, unlike the AB directed 
model. With respect to the directed model it contains much more links, many of them 
slightly varying in the conformational change. This excess of (invariant) links hides the 
small differences between the native and active states. So, while the larger effects due to C a 
displacements can emerge once again, the smaller improvements due to the Cp displacements 
cannot be appreciated. 

By performing the same investigation for the case of AChE, we noticed that the differ- 
ences between the native and activated configurations are significantly smaller than those of 
rhodopsin. In the AA model, with R = 6 A we have found a difference of only 6 %, while, 
for rhodopsin it was of 16 %. Furthermore, this difference decreases at increasing R , unlike 
the case of rhodopsin. This implies a low level of resolution between the configurations, even 
for the most sensitive AB directed model. Accordingly, in Figs. 10 and II we report only 
the comparison between the AA model and the directed AB Q)/3 model, which among the AB 
models exhibits the best resolution. 

From Fig. 10, we observe that for R = 6 A the AA model shows a resolution higher 
than that of the AB model, as expected in case I. For R = 9 A the difference between the 
native and activated configuration is less than 0.1 % in the AA model and it is of 2 % in 
the directed AB model, as expected in case III. Figure 10 (c) emphasizes this difference. 

Figure 11 reports the Nyquist plot of AChE for R = 12 A and 25 A, respectively. 
We notice that in both cases the AA model is no longer able to resolve the native from 
the activated configuration, while the directed AB model resolves a difference between the 
configurations of 2 % for R = 12 A , as expected in case III. For R = 25 A , also the directed 
AB model is no longer able to resolve the difference between the configurations, as expected 
in case IV. 
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In Figs. 7 to 10, it is shown that for R = 6 A , the Nyquist plots take shapes, which are 
slightly squeezed and asymmetric semicircles instead of the perfect semi-circle pertaining 
to a single RC parallel circuit (see Appendix for details). However, by increasing the value 
of R , the Nyquist plots better and better approach the perfect semi-circle. The above 
peculiarities can be satisfactorily interpreted in terms of the Cole-Cole function 26[ with 



one fitting parameter. Accordingly, to interpret the Nyquist plot, we use the normalized 
dimensionless response function |27| : 



I u = (Z(lu) - Z oo )/(Z(0) - Z^) (7) 
which, in our case, reduces to Z(u)/Z(0). The Cole-Cole fitting function is: 

L = z , l N1 , < a < 1 (8) 

which leads to the following relation between the real and imaginary parts of I w 

W*) - \f + W-) - \f = \(i - «) + \ (9) 

with 

a = cos(ira/2) 

b = sm (7ra/2) 

c = 1 + (cjr) 2(1 - a) + 2{ury- a b 

where 1/r = lom is the frequency value corresponding to the maximum value taken by 
—X(I U ) as function of TZ{I U ) (see Appendix). 

Figure 12 reports the fitting of the Nyquist plot for 2ACE, with R = 6 A in the AA 
model, obtained with the Cole-Cole function [26[ and a = 0.09 together with the ideal semi- 
circle shape corresponding to a = 0. The fact that for R > 6 A the Nyquist plots take the 
ideal semi-circle shape is explained by the predominance increase of parallel with respect to 
serial connections. Thus, the network is no longer able to resolve the single relaxation times 
pertaining to each RC link but exhibits an average time constant. 

The Cole-Cole function is one of the most used fitting functions in relaxation processes 
deviating from the Debye-Maxwell behavior [28]. It is not the unique, of course, due the 



complexity of the possible origins of this deviation |27l . |29| . On the other hand, it has 
been shown that its meaning is more vast than simple fitting function [30]. In fact, while 
the power spectrum \Iu>\ 2 associated with the Debye-Maxwell function is Lorentzian, and 
the correlation function is exponential, the power spectrum associated with the Cole-Cole 
function is a more tangled object. It reduces to the Lorentzian distribution for a = 0, 
furthermore, it goes like (cot)^ 1 ^^ for cur ^> 1 and like (1 + 2(wr)( 1_a )sm(7ra/2_)) _1 for 
lot 1. The corresponding correlation function is the Mittag-Leffler function [3l| which 
interpolates between a stretched exponential pattern ( tor 3> 1) and an inverse power law 
decay {lot 1). The exponent of both the functions is the same, 1-ot. 

For completeness, in the Appendix we report the single resistance and capacitance values 
corresponding to the calculated Nyquist plots. 
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IV. CONCLUSIONS 



We have carried out a systematic analysis of protein modelling by means of a coarse 
grained network approach. As application we considered the case of bovine rhodopsin, the 
prototype of the GPCR family, and the case of AChE enzyme, whose inihibition is one of 
the most qualified treatments of the Alzheimer disease. 

By using the topological features of the network, some relevant PDB entries have been 
compared in terms of the number of links as function of the interacting radius. We notice 
that the network map of the protein is able to distinguish among different PDB entries, and 
also to reproduce, with a fine choice of the cut-off R , some topological properties of the 
protein structures. We conclude that the network approach is a suitable tool to discriminate 
among different protein structures. 

By using the features of the impedance network associated with the topological one, we 
have investigated the dynamic electrical response of the proteins through the Nyquist plot 
representation. Accordingly, we predict for the rhodopsin an electrical response which is of a 
quite detectable level (up to difference of 22 % when passing from the native to the activated 
state). Furthermore, a significative conformational change is identified both with the one- 
node AA model and with the two-nodes AB models, the latter foreseing significantly larger 
differences among the configurations. These results are supported by some experimental 



evidences [25[ : Bovine rhodopsin and the rat olfactory receptor 17 has been immobilized on 
a gold electrode building up a self-assembled multilayer. The electrochemical characteristic 
of these structures has been performed in a standard electrochemical cell, and has shown 
Nyquist plots qualitatively similar to those obtained in the present paper. 

Finally, we have found that the electrical responses of bovine rhodopsin and torpedo 
AChE are quite different, with regard to the possibility to distinguish between the activated 
and native state. For the couple 2ACE-1VOT-2, the maximal difference is of 6 %, and it is 
obtained for a small value of the interaction radius R = 6 A . We conclude that such a small 
sensitivity is due to an effectively small difference between the two states. Furthermore, 
since the maximal resolution between the native and the activated state is for small values 
of R , the conformational change acts only among nearest neighboring. However, this is 
not the only possibility. In fact, in a recent work on a particular enzyme, the cyclophilin A 
(CypA) [32|, it has been shown that also in the native form, the enzyme lives part in the 
activated and part in the native state. In the activated state, the percentage of activated 
configurations simply increases. According to this conclusion, also for AChE, our results 
should demonstrate that the crystallographic image of native AChE is a mix of the native 
state and of the Huperzine A (in our case) state, and therefore its conformational is of weak 
relevance. 
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APPENDIX 



The frequency response of the impedance network presented here can be modeled to a 
good degree of approximation, by the impedance of a single RC parallel circuit 

Z«7= f =^ + g, (A.l) 
1 + iujKC 

where TZ, X indicates the real and imaginary part of Zrc, respectively. It is convenient to 
normalize the impedance, in order to better compare results coming from different proteins 
and boundary conditions, so we introduce 

z RC = ^ = n + ti, (A.2) 

with R the impedance at zero frequency. Notice that the function ( 1A.2I) is a first order 
function, i.e. it has only one pole in frequency and it is stable. The maximum of — X occurs 
for — X = 1Z = 1/2 and is obtained for u = ujm = 1/RC. 

In the Nyquist plots calculated for the impedance networks, to account for the non-ideal 
semicircle shape, we have determined the frequency u* at which —X = 1Z and defined an 
effective capacitance, C*, by the relation u* = 1/RC*. The difference between the values 
of C and C* is a signature of the deviation of the Nyquist plot from the perfect semi-circle 
shape. The values of R, C, and C* corresponding to the Nyquist plots reported in Figs. 7, 
8, 10, 11 are reported in Table I (rhodopsin) and in Table II (AChE). 
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TABLE I: Resistance and capacitances of the RC single-impedance circuit equivalent to the protein 
impedance network. Model AA is compared with the AB Q ( g and AB Q ^ Q ^ models for R =6,12,25,50 
A . Rhodopsin is the analyzed protein 





AA Model 


AB a a Model 


AB afta/3 Model 


R /Type 


R (PO) 


C(fF) 


C*(£F) 


R(Pfl) 


C(fF) 


C* (£F) 


R(Pfl) 


C(fF) 


C*(£F) 


R =6A 
Rho 

Meta 


9.49 10 3 
1.13 10 4 


1.53 10~ 4 
1.21 10" 4 


1.32 10~ 4 
9.31 10" 5 


4.72 10 3 
5.37 10 3 


3.26 10~ 4 
2.66 10" 4 


2.82 10~ 4 
1.86 10" 4 


4.47 10 3 
5.01 10 3 


3.60 10~ 4 
3.07 10" 4 


3.19 10~ 4 
2.50 10" 4 


R =12A 
Rho 

Meta 


2.51 10 2 
3.22 10 2 


5.37 10~ 3 
3.57 10" 3 


5.16 10~ 3 
3.41 10" 3 


1.29 10 2 
1.81 10 2 


9.69 10~ 3 
5.51 10" 3 


9.69 10~ 3 
5.01 10" 3 


9.07 10 1 
1.24 10 2 


1.47 10~ 2 
9.35 10" 3 


1.43 10~ 2 
8.93 10" 3 


R =25A 
Rho 

Meta 


1.67 10 1 
2.04 10 1 


6.90 10~ 2 
5.38 10" 2 


6.90 10~ 2 
5.38 10" 2 


1.07 10 6 
1.42 10 6 


1.04 10" 1 
7.04 10" 2 


9.82 10~ 2 
7.04 10" 2 


5.99 
7.06 


1.99 10- 1 
2.83 10" 1 


1.96 10- 1 
2.83 10" 1 


R =50A 
Rho 

Meta 


2.31 
2.72 


4.32 lO" 1 
3.34 10" 1 


4.32 10" 1 
3.34 10" 1 


1.61 
1.97 


5.65 10- 1 
4.24 10" 1 


5.65 10- 1 
4.24 10" 1 


0.82 
0.98 


1.21 
9.30 10" 1 


1.21 
9.30 10" 1 
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TABLE II: Resistance and capacitances of the RC single-impedance circuit equivalent to the protein 
impedance network. Model AA is compared with the AB aja and AB Q/ g Q/ g models, for R =6,9,12,25 
A . AChE is the analyzed protein 





AA Model 


AB a /3 Model 


R /Type 


R(Pfi) 


C(fF) 


C*(fF) 


R(Pfi) 


C(fF) 


C* (£F) 


R =6A 
2ACE 
1VOT-2 


1.51 10 4 
1.43 10 4 


7.23 10~ 5 
8.74 10" 5 


6.00 10~ 5 
6.99 10" 5 


3.63 10 3 
3.50 10 3 


3.93 10~ 4 
4.08 10" 4 


3.67 10~ 4 
3.57 10" 4 


R =9k 
2ACE 
1VOT-2 


1.45 10 3 

1.46 10 3 


1.11 10~ 3 
1.10 10" 3 


1.01 10~ 3 
1.07 10" 3 


5.76 10 2 
5.88 10 2 


2.89 10~ 3 
2.84 10" 3 


2.67 10~ 3 
2.62 10" 3 


R =12A 
2ACE 
1VOT-2 


3.57 10 2 
3.57 10 2 


5.10 10~ 3 
5.10 10" 3 


5.10 10~ 3 
5.10 10" 3 


1.42 10 2 
1.45 10 2 


1.28 10~ 2 
2.51 10" 2 


1.28 10~ 2 
2.51 10" 2 


R =25A 
2ACE 
1VOT-2 


2.08 10 1 
2.08 10 1 


9.80 10~ 2 
9.63 10" 2 


9.80 10~ 2 
9.63 10" 2 


9.71 
9.76 


2.07 10" 1 
2.05 10" 1 


2.06 lO" 1 
2.05 10" 1 



15 



a) 



n 
E 

3 



o 



o 

1— 



800 


i | i | i | i 




MdllVC _ 


400 


L rhodopsin 


o| 




-400 


^* 




r w 


-800 




1200 


i , i , i 



25 50 75 100 



1500 

500 


-500 
-1500 
-2500 



b) 



Activated 
rhodopsin 




25 50 75 100 

R(A) 



FIG. 1: Link number difference (LND) for different configurations of rhodopsin versus the interac- 
tion radius. All the configurations contain the same number of amino acids. Figure 1 (a): Native 
rhodopsin. Stars refer to the LND between the a-chain of 1U19, lU19a and the engineered repre- 
sentation of native rhodopsin, Rho. Open triangles refer to the LND between the a-chain of 2G87, 
2G87a and Rho. Full circles refer to the LND between the a-chain of 2HPY, 2HPYa and Rho. 
Open diamonds refer to the LND between the a-chains 2HPYa and lU19a. Continuous line refers 
to the LND between 2G87a and lU19a. Figure 1(b): Activated rhodopsin. Continuous line refers 
to the LND between the engineered representation of rhodopsin in light, Metall, and Rho. Open 
circles refer to the LND between 2137 and Rho. Open triangles refer to the difference between 
1LN6 and Rho. In the second and third case the Rho sequence has been deprived of the amino 
acids that are not present in 2137 and 1LN6, respectively. 
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25 50 75 100 25 50 75 100 

R(A) R(A) 

FIG. 2: Link number difference (LND) for different configurations of AChE versus the interac- 
tion radius. All the configurations contain the same number amino acids. Figure 2(a): Native 
configurations. Tiny continuous line refers to the LND between two chains of the 1EA5 represen- 
tation: 1EA56, lEA5a. Bold continuous line refers to the difference between lW75a and 2ACE. 
Open circles refer to both: the LND between lEA5a and 1W756 and the LND between 1EA56 and 
1W756. Figure 2(b): Complexed configurations. Tiny black line refers to the LND between 2ACE 
and 1ACJ. Bold black line refers to the LND between 2ACE and 1ACL. Open squares refer to the 
LND between 2ACE and 1AX9. Open circles refer to the LND between 2ACE and 1VOT. Open 
triangles refer to the LND between 2 ACE and lGPKa. 
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FIG. 3: Adjacency matrix of native rhodopsin, Rho. The x,y axes report the sequential number 
of the amino acids. Black full circles refer to R = 6 A , and grey open squares to R = 12 A . Each 
circle/square correspond to a link between the couple of amino acids (x,y). The main domains 
associated with the connections among closest helices are explicitly indicated. 
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FIG. 4: Adjacency matrix of 2ACE. The x,y axes report the sequential number of the amino 
acid. Black full circles refer to R = 6 A , and grey open squares to R = 12 A . Each circle/square 
corresponds to a link between the couple of amino acids (x,y). Dashed lines are in correspondence 
with the three amino acids of the active site. 
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FIG. 5: Degree distribution of native-rhodopsin (rho) network for increasing values of the inter- 
action radius R . 
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FIG. 6: Degree distribution of the native-AChE network for increasing values of the interaction 
radius R . The inset reports the degree distribution for R =25 A (solid line), R =50 A (dotted 
line), R =80 A (bold dashed line). 
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FIG. 7: Nyquist plots of the network impedance associated with the protein. Native rhodopsin is 
compared with activated rhodopsin. The protein sequences are the engineered Rho (native) and 
Meta II (activated). For the AA model: Continuous tiny lines refer to Rho, dotted lines refer to 
Meta II. For the AB a)Q , directed model: Bold continuous lines refer to Rho, dashed lines refer to 
Meta II. Plots are reported for increasing values of the interaction radius in the range from 6 A to 
50 A following a clock-wise orientation. 
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FIG. 8: Nyquist plot of the network impedance associated with the protein. Native rhodopsin is 
compared with activated rhodopsin. The protein sequences are the engineered Rho (native) and 
Meta II (activated). For the AA Model: Continuous tiny lines refer to Rho, and dotted lines refer 
to Meta II. For the AB a/ 3 )a/ 3 directed model: Bold continuous lines refer to Rho, and dashed lines 
refer to Meta II. Plots are reported for increasing values of the interaction radius in the range from 
6 A to 50 A following a clock-wise orientation. 
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FIG. 9: Nyquist plot of the network impedance associated with the protein. Native rhodopsin is 
compared with activated rhodopsin. The protein sequences are the engineered Rho (native) and 
Meta II (activated). For the AA Model: Circles refer to Rho, and dotted lines refer to Meta II. For 
the AB isotropic model (with in-contact on the first C a carbon atom and out-contact on the last 
C a carbon atom): Bold continuous lines refer to Rho, and dashed lines refer to Meta II. Plots are 
reported for increasing values of the interaction radius in the range from 6 A to 50 A following a 
clock-wise orientation. 
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ReZ(co)/ReZ 2ACE (0) ReZ(co)/ReZ 2ACE (0) 

FIG. 10: Nyquist plot of the network impedance associated with the protein. Native AChE, 
2ACE, is compared with AChE complexed with Huperzyne A, 1VOT-2. For the AA model: Tiny 
continuous lines refer to 1VOT-2, dotted lines refer to 2ACE. For the AB Q / g directed model: Bold 
continuous lines refer to 1VOT-2, and dashed lines refer to 2ACE in the directed AB Q a model. In 
Fig. 9(a) R = 6 A , while in Figs. 9(b) and 9(c) R = 9 A ; Fig. 9(c) is a zoom of Fig. 9(b). 
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FIG. 11: Nyquist plot of the network impedance associated with the protein. Native AChE, 
2ACE, is compared with AChE complexed with Huperzyne A, 1VOT-2. For the AA model: Tiny 
continuous lines refer to 1VOT-2, dotted lines refer to 2ACE. For the AB Qj/ g directed model: Bold 
continuous lines refer to 1VOT-2, and dashed lines refer to 2ACE in the directed AB Q $ model. In 
Figs. 10(a) and 10(b) R = 12 A , and Fig. 10(b) is a zoom of Fig. 10(a). In Fig. 10(c) R = 25 A . 
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FIG. 12: Nyquist plot of the 2ACE for R = 6 A , with the Cole-Cole distribution. Stars refer 
to data from simulations, the dotted line refers to the Debye distribution function, the continuous 
line refers to the Cole-Cole distribution function with the best fit parameter a = 0.09. For all the 
distributions the value r =1.12 s is used. 
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